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A short proof of the Marchenko-Pastur theorem. 

Pavel Yasko 

Abstract 

We prove the Marchenko-Pastur theorem for random matrices with i.i.d. rows and a general dependence 
structure within the rows by a simple modification of the standard Cauchy-Stieltjes resolvent method. 
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1 Introduction 

Let X pn be a p x n random matrix whose columns {x p fc}fc=i are i.i.d. copies of some random vector x p in M p 
for all p,n ^ 1. All random elements are defined on the same probability space. The object of our study is 
fipn, the empirical spectral distribution (ESD) of n -1 X pn Xj n . Here ESD of a p x p real symmetric matrix A 
is defined by 

1 p 

%— 1 

where 6\ stands for the Dirac mass at A S 1 and Ai ^ ^ A p are eigenvalues of A. 

Recall that the Marchenko-Pastur law /r c with parameter c > 0 is the probability distribution 

(1 “ V c ) + A + x )( x — °lj( x € [ fli 5]) dx, 

2lTCX 

where x + = max{x, 0} for x E R, a = (1 — y/c) 1 2 , and b = (1 + y r c) 2 . 

The Marchenko-Pastur theorem states that, for any p = p(n) with p/n —> c > 0 as n —> oo, 

P(^pn -A P'c in distribution, n -A oo) = 1 (1) 

if each x p has centred orthonormal entries {X p k] p k=1 satisfying certain conditions. 
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The standard conditions include the independence of {X pk } p =1 and the Lindeberg condition 

1 p 

lim EX pk I(\X pk \ > £y/p) = 0 for all e > 0 (2) 

p-kx> p fe=i 

(see Theorem 3.10 in [3]). Bai and Zhou [2], Pastur and Pajor [6], and Pastur and Scherbina |7] (see Theorem 
19.1.8) proved the Marchenko-Pastur theorem, assuming that Var(xJA p x p /p) —>• 0, p —>• oo, for all sequences 
of p X p complex matrices A p with uniformly bounded spectral norms ||A p ||. If entries of x p are independent, 
this assumption is much stronger than d2|). 

In this note we give a short proof of the Marchenko-Pastur theorem under weaker conditions that cover 
all mentioned results. 

2 Main results 

Consider the following assumption. 

(A) (xjA p Xp — tr (Ap))/p A- 0 as p —» oo for all sequences of p x p complex matrices A p with uniformly 
bounded spectral norms ||A p ||. 

Theorem 2.1. If (A) holds, then (pQ) holds. 

If entries of x p are orthonormal, then Ex p A p x p = tr(A p ) and the assumption considered in |2j, [6], [7j (see 
Introduction) is stronger than (A). In addition, we have the following proposition. 

Proposition 2.1. Let {X pk } k=1 be independent random variables with EX pk = 0, EX^ k = 1 for each p ^ 1. 
Then © holds if and only if (A) holds for x p = (X p i ,..., X pp ), p ^ 1. 

Assumption (A) also covers the case, where entries of x p are orthonormal infinite linear combinations (in 
Lf) of some i.i.d. random variables with Ee k = 0 and Ee| = 1 (see Corollary 4.9 in arXiv:1410.5190). 

Remark. We get an equivalent reformulation of (A) if we replace complex matrices by real symmetric positive 
semi-definite matrices (see Lemma 5.3 in arXiv:1410.5190). 

3 Proofs 

Proof of Theorem 12.11 We will use the Cauchy-Stieltjes transform method. By the Stieltjes continuity 
theorem (e.g., see Exercise 2.4.10(i) in [8]), we only need to show that s n (^) —>• s(z) a.s. for all z € C with 
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Iiri(z) > 0, where s n = s n (z) and s = s(z) are the Stieltjes transforms of p pn and p c defined by 


Sn 



Ppn (d\) 

X — z 


and s(z) 


Vc(dA) 

A — z 


By the definition of p pn , s n (z) = tr(ra _1 X pn X pn — zl p )~ l /p for the p x p identity matrix I p . 

Fix any C with v = Im(z) > 0. By the standard martingale argument (e.g., see Step 1 in the proof 
of Theorem 1.1 in |2J or Lemma 4.1 in ID), we derive that s n (z) — Es n (z) —>• 0 a.s. We finish the proof by 
checking that Es n (T) —> s(z). We need a technical lemma. 

Lemma 3.1. Let C be a real symmetric positive semi-definite p x p matrix and x E M p . If z G C is such 
that v = Im( 2 ) > 0, then (1) ||(C — zl p )~ l \\ ^ 1/v, (2) |tr(C + xx T — zl p ) _1 — tr(C — zl p )~ 1 \ ^ 1/v, (3) 

| x T (C + xx T — zl p )~ l x )| ^ 1 + \z\/v, (4) Im (2 + ztx{C — zip) -1 ) ^ v and Im(tr(C — zl p )~ l ) > 0, (5) 
Im (2 + zx T (C — zl p )~ 1 x) ^ v. 

All bounds in Lemma 13. II are well-known. Part (1) can be proved by diagonalizing C. Part (2) is given in 
Lemma 2.6 in j4|. Part (3) follows from the Sherman-Morrison formula and Part (5), since 


x T (C + xx 1 —zl p ) 1 x = x'(C — zI p ) 1 x — 


T 


T/ 


(x'(C-zI p ) 1 x) 2 


1 + x T (C — zl p ) l x 


= 1 - 


z + zx T (C — zI p ) l x 


Parts (4)-(5) can be checked by showing that Im(tr((l/ 2 )C I — I p ) 1 ) ^ 0 and Im(x T ((l/z)C — I p ) x x) ^ 0. 
Take x p = x Pin+1 to be independent of the matrix X pn and distributed as its columns {x p fc}£ =1 . Define 


n+1 


A. n Xp n Xp n ^ ^ ^-pk^-pk B n A n + XpXp ^ ^ ^-pk^-pk’ 


k =1 


k =1 


By Lemma HOT 1). B n — znl p is non-degenerate and 


n+l 

p = tr {{B n - znI p )(B n - znlp)- 1 ) = ^xj fc (fi n - 2 n/ p )~ 1 x pfc - zntr(B n - znlp)- 1 

k =1 

Taking expectations and using the exchangeability of {x p fc}^jt|, 


p =(n + l)ExJ (B n — znlp) 1 x p — znEtr(B n — znl p ) 1 . 


( 3 ) 
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Define S n (z) = tr(A n — znl p ) 1 and note that S n (z) = ( p/n)s n (z ). By Lemma libTT 2)-(3). 

Etr (B n — znlp)^ 1 = E5 n (z) + 0(l/n) and ExJ(B ra — zn/p) _1 x p = 0(1). 


Moreover, we will show below that 


ExJ(B„ - znl p )-'x p = + °(D 


( 4 ) 


Suppose for a moment that (j4]) holds (and p/n = c + o(l)). Then ([3]) reduces to 


VS n (z) 

1 + E S n (z) 


zES n (z) 


c + o(l). 


By (1) and (4) in Lemma 13.11 (E5 n (z))£T 1 is a bounded sequence with Im(E5 n (z)) > 0. It is straightforward 
to check that the limiting quadratic equation 5/(1 + S) — zS = c or zS 2 + (z — 1 + c)S + c = 0 has a unique 
solution 5 = S(z) with Im(5(z)) ^ 0 when Im (z) > 0. As a result, any converging subsequence of bounded 
sequence (E5 n (z))^L 1 tend to S(z). This implies that E5 n (2) = (p/n)Ks n (z) S(z). 

One can also show that S(z) = cs(z) is the above unique solution, where s(z ) is the Stieltjes transform 
of the Marchenko-Pastur law (see Remark 1.1 in m )• Combining all above relations, we conclude that 
s n (z) —>• 5 ( 2 ) a.s. 

To finish the proof, we only need to check (j4|) . By the Sherman-Morrison formula, 


xj (B n - znlp) l x p = xj (An + x p xj 


znL 


\-i. 


xj (A n - znlp) 1 x p 
1 + xj (A n - znlp )“ 1 x p 


Using Lemma [3TTT 11. (A), and the independence of x p and A n , we get xj (A n — znl p ) 1 x p — S n (z) A 0. In 
addition, as it is shown above, 


S n (z) - ES n (z) = (p/n)(s n (z) - Es n (z)) A 0. 


Hence, Lemma [3TlT 4)-(5) and inequality |1 + w\ ^ Im (z + zw)/\z\, w € C, yield 


x p (A n - znlp) x x p 


E5 n (z) 


1 + xj (An - znlp ) x x p 1 + E S n (z) 


\ z \2 

< A^\ X J (An - znIp)~ l yL p - E5 n (z)| A 0. 
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Finally, (QJ) follows from Lebesgue’s dominated convergence theorem and Lemma 13.11 3). 


□ 


Proof of Proposition 12.11 For each p ^ 1, let A p = (ajy •_ 1 be a complex p x p matrix with ||^4 p || ^ 1. 


If D p is a diagonal matrix with diagonal entries (ajp then 


E 


X J (A p — D p )x p " ^ 2E| a kj X pkX pj “ + 2E| ^ a { gx pk X pj ' = 2 ^ |ag| 2 ^ 4tr(A p ,4;), 


(p) 


. (p) 12 








where Jl* is the complex conjugate of A p . By the definition of the spectral norm, tr(^4 p ^4*) ^ ||t4 p || 2 p. Thus, 

x p (Ap — Dp)x p p^ 
p 

To finish the proof, we need to show that d2j) holds if and only if 


1 P 

- ^ a^\Xp k — 1) A 0 for any triangular array {a^\ 1 ^ k < p,p ^ 1} with \a^\ ^ 1. (5) 


P 


k =1 


Let ([5]) hold. Then Z p = p 1 Yl P =±Xp k A 1. Since EZ p = 1 and Z p ^ 0 a.s., the Vitali convergence 
theorem yields E|Z p — 11 —>• 0. Using inequalities p~ l Ylk=\ ^\ X pk — 1| ^ 2, p ^ 1, we derive from [5] that 


1 P 

- E\X* k - l\I(\Xp k - 1| > ep) -► 0 for all £ > 0. 

P _ i 


fc=i 


Obviously, this is equivalent to ©. 

Let ([2]) hold. By the Marcinkiewicz-Zygmund inequality, there is a universal constant C > 0 such that 


E | ^ E4 p) (vj -1)|« A(E< x p‘ - 1 > : 

1 k =1 P k =1 


1/2 


Using ([2]1 . Jensen’s inequality, and y/x + y ^ yfx + y/y, x,y ^ 0, we get 


E 


p p p 

(Y,( x2 P k - 1 ) 2 ) i/2 - !) 2/ (i x 2 Pk -iK k ) 1/2 + t^n x p k - m\ x2 P k -11 > 


k =1 


k = 1 


fc=l 


^pV2s + o(p) 
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for all e > 0, where we also applied the bound E{X 2 k — l) 2 I(\X 2 k — 1| < sp) < epE\X 2 k — 1| ^ 2 ep. Therefore, 


lim e|- 

p —^oo | p 


k =1 


^ y/2e. 


Tending e to zero, we get (|5|). 


□ 
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